Order:
  1.  39
    Speaker Identification Using Empirical Mode Decomposition-Based Voice Activity Detection Algorithm under Realistic Conditions.R. Kumaraswamy, V. Kamakshi Prasad, Nilabh Kumar Pathak & M. S. Rudramurthy - 2014 - Journal of Intelligent Systems 23 (4):405-421.
    Speaker recognition under mismatched conditions is a challenging task. Speech signal is nonlinear and nonstationary, and therefore, difficult to analyze under realistic conditions. Also, in real conditions, the nature of the noise present in speech data is not known a priori. In such cases, the performance of speaker identification or speaker verification degrades considerably under realistic conditions. Any SR system uses a voice activity detector as the front-end subsystem of the whole system. The performance of most VADs deteriorates at the (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  2.  14
    Speaker Verification Under Degraded Conditions Using Empirical Mode Decomposition Based Voice Activity Detection Algorithm.R. Kumaraswamy, V. Kamakshi Prasad & M. S. Rudramurthy - 2014 - Journal of Intelligent Systems 23 (4):359-378.
    The performance of most of the state-of-the-art speaker recognition systems deteriorates under degraded conditions, owing to mismatch between the training and testing sessions. This study focuses on the front end of the speaker verification system to reduce the mismatch between training and testing. An adaptive voice activity detection algorithm using zero-frequency filter assisted peaking resonator was integrated into the front end of the SV system. The performance of this proposed SV system was studied under degraded conditions with 50 selected speakers (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  3.  24
    Voice Activity Detection Algorithm Using Zero Frequency Filter Assisted Peaking Resonator and Empirical Mode Decomposition.R. Kumaraswamy, V. Kamakshi Prasad & M. S. Rudramurthy - 2013 - Journal of Intelligent Systems 22 (3):269-282.
    In this article, a new adaptive data-driven strategy for voice activity detection using empirical mode decomposition is proposed. Speech data are decomposed using an a posteriori, adaptive, data-driven EMD in the time domain to yield a set of physically meaningful intrinsic mode functions. Each IMF preserves the nonlinear and nonstationary property of the speech utterance. Among a set of IMFs, the IMF that contains source information dominantly called characteristic IMF can be identified and extracted by designing a zero-frequency filter-assisted peaking (...)
    Direct download  
     
    Export citation  
     
    Bookmark